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I . Enter input sequence of desired length 



2. Enter window size 
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3. Axis chosen 3 
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4. Calculate and output polarity value b 

5. Move to next axis ^ 
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6. Calculate and output polarity value around new axis, then. . . c 
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7. Output ordered list of polarity values 
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8. Graph these values d 

9. Statistical analysis of observed vs. predicted 

10. Identify regions of extended polarity 



a Starting at position = (2*window of symmetry) 
b [I-(S/W)] 

0 Up to and including axis position = [2*length- (2*window size)] 

d Can use a moving average of values (with number of values averaged and increment of 

moving being variable) to smooth curve 
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The aigorythm was implemented in PERL programing language. 
PERL variable-names and function-names are in boldface. 




import input sequence as a string variable $tnput_seq ) 




prompt user for ($win_sym ) length 



cut a length of (2*$win_sym ) bases from the 5' end of$input_seq and assign this 
substring into $win_seq 
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perform 
(2*win_sym) 
iterations 



chop (cut last character off a string and return it$win_seq 



unshift chopped character (prepend it to the front of an array, moving all elements one step to 
the right) into an initially empty indexed array @trgt_fwd ) 



perform 
(2*win_sym 
iterations 



translate $win_seq so that every G or A or T or C of the original string 
become C or T or A or G respectively of the translated string 



chop the translated $win_seq 



push chopped character (stack it onto the end of an array, without moving the other elements) 
onto an initially empty indexed array @trg t_re vc o mp ) 



assign $match_count = 0 



assign index variable $i = 1 



perform 
(win_sym, 
iterations 



compare between the $i th elements of array s@trgt_fwd and @trgt_revcomp ; if equal then 
increase $ match count by 1 , else nothing. 



increase 



$i by 1] 



calculate $asym_count - 1 - ($match_count / win sym) 



push $asym_count onto initially empty indexed array@axts_jist 
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► while $input_seq is not empty 



cut one base off of the 5* end of input_seq and assign it into$basefeed 



translate $basefeed GATC~>CTAG respectively and assign the 
translated character into$basefeed_comp 



unshift $basefeed_comp (prepend it to the front of an array, moving all elements one 
step to the right) into array @trgt_revcomp - 



pop (remove last element of an array@trgt_revcomp 



assign $match_count = 0 



assign index variable $i = 1 



perform 
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iterations 



compare between the $i th elements of arrays@trgt_fwd and @trgt_revcomp ; if equal then 
increase $matcti_count by 1 , else nothing. ___ ■ 



increase $i by 1 



calculate $asyrn_count = 1 - ($match_count / win sym) 
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push $asym_count onto @axis_iist 



shift (remove first element of an array and move all elements one step 
leftward) @trgt_fwd 



push $basefeed onto a rray @trgt_fwd 



assign $match_count = 0 



assign index variable $i = 1 



perform 
(win_sym] 
iterations 



compare between the $i th elements of arrays@trgtjwd and @trgt_revcomp ; if equal then 
increase $match_count by 1 , else nothing. . ■ — 



increase $i by 1 



calculate $asym_count = 1 - ($match_count / win sym) 



push $asym_count onto @axis_Jist 



save @axis list to file 
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104 Count DPT frequencies, calculate and output DPT residuals and chi square value 
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105 and 105* Move to next axis 



106 and 106 1 Count DPT frequencies, calculate and output DPT 
residuals and chi square value around new axis, then. ,. b 



107 Save to file: ordered arrays of DPT frequencies, DPT residuals and chi square values 
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108 Further statistical analysis of observed vs. predicted 



109 Graph values* 



1 10 Identify functional elements 



Starting at axis position = (2* window size) 

Up to and including axis position = [2*length - (2*window size)] 

Values include DPT frequencies, statistical measures including residuals and 
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